errors in statistical analysis and questionable randomization lead to unreliable conclusions
نویسندگان
چکیده
dear editor, we read with interest the paper, “the effect of food service system modifications on staff body mass index in an industrial organization”[1]. we noticed several substantial issues with data and calculations, calling into question the randomized nature of the study and validity of analyses.the distribution of baseline weight was significantly differentbetween groups (p-value = “0.00”). we replicated the test using reported means and standard deviations (sds) andobtained a p-value of approximately 1.9*10 -17 . it is extraordinarily unlikely that any variable would be that different between two groups if allocation was truly random. even it was truly random, the stated method of “the samples were randomly divided into two groups”[1] does not describe the “method used to generate the random allocation sequence” and the “type of randomization; details of any restriction (such as blocking and block size)” details specified by consolidated standards of reporting trials (consort)[2].given the large difference in baseline weights, it is unusual that the difference in baseline body mass index (bmi) between groups is not more significant (p=0.032), raising the question of what the groups’ distributions of height were. both groups have 30 males (58.8%), so sex differences are unlikely to explain this discrepancy. height was not explicitly reported, but it was possible to estimate height utilizing geometric means from body weight and bmi[3,4]. we calculated the baseline control group geometric mean as 2.04 cm taller than the test group. these calculations also suggest the control group shrunk by 1.26 cm while the test group grew by 1.52 cm over the study. neither change is explained by rounding error nor seems plausible for adult subjects over 40 days.because there were no sds of the change scores reported, we were unable to replicate the reported p-value (0.318) for the between-group test of weight change exactly. however, we were able to consider the pre and post-intervention sds and calculate possible sds of within-group change scores for a range of pre-post correlations. the largest p-value possible was 0.1282, calculated when each group had perfect negative pre-post correlation (correlation=-1), which is unlikely. if there was no or a positive correlation, the p-value would be much smaller (p=0.0449 when correlation=0 for each group) and plausibly indicates a significant difference between groups. therefore, although the published results are impossible the correct analysis could make the intervention appear more effective than reported.the results section describes an initial sample size of 116 with 14 dropping out (p. 115). the tables report the remaining sample size to be 102, but the body of the text reports 101 subjects remained until study completion. it is unclear which values were correct; this lack of clarity also fails consort guidelines[2].considering that the reported findings are essentially impossible given the stated study design, we encourage the authors to explain the treatment allocationand make the raw data available, or the journal to act according to the committee on publication ethics[5] in situations where findings are unreliable
منابع مشابه
Errors in statistical analysis and questionable randomization lead to unreliable conclusions.
Dear Editor, We read with interest the paper, “The effect of food service system modifications on staff body mass index in an industrial organization”[1]. We noticed several substantial issues with data and calculations, calling into question the randomized nature of the study and validity of analyses. The distribution of baseline weight was significantly differentbetween groups (p-value = “0.0...
متن کاملQuestionable conclusions on cannabis and crime.
In a paper published recently in this journal [1], Pedersen & Skardhamar examine whether cannabis users are at increased risk of being charged with criminal offences. They find an association between frequency of cannabis use in one year and risk of criminal charges in subsequent years. The association is statistically significant with respect to drug-specific crime only, and not to other types...
متن کاملRandomization in clinical trials: conclusions and recommendations.
The statistical properties of simple (complete) randomization, permuted-block (or simply blocked) randomization, and the urn adaptive biased-coin randomization are summarized. These procedures are contrasted to covariate adaptive procedures such as minimization and to response adaptive procedures such as the play-the-winner rule. General recommendations are offered regarding the use of complete...
متن کاملErrors in the Statistical Analysis
Egri et al. (2012) investigated how the attractiveness of substrates to tabanids (horseflies) varied when painted with different numbers, orientation, and density of stripes. The basic experimental design was a randomized complete block design where the number of flies attracted was counted at regular intervals. The authors analyzed this data using a complete-randomized design ANOVA which is in...
متن کاملStatistical analysis of timing errors.
Human rhythmic activities are variable. Cycle-to-cycle fluctuations form the behavioral observable. Traditional analysis focuses on statistical measures such as mean and variance. In this article we show that, by treating the fluctuations as a time series, one can apply techniques such as power spectra and rescaled range analysis to gain insight into the mechanisms underlying the remarkable abi...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
journal of paramedical sciencesجلد ۶، شماره ۳، صفحات ۰-۰
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023